SMART Frame Selection for Action Recognition
نویسندگان
چکیده
Video classification is computationally expensive. In this paper, we address theproblem of frame selection to reduce the computational cost video classification.Recent work has successfully leveraged for long, untrimmed videos,where much content not relevant, and easy discard. work, however,we focus on more standard short, trimmed problem. Weargue that good can only videoclassification but also increase accuracy by getting rid frames are hard toclassify. contrast previous propose a method instead selectingframes considering one at time, considers them jointly. This results in moreefficient selection, where “good" effectively distributed over thevideo, like snapshots tell story. We call proposed SMARTand test it combination with different backbone architectures multiplebenchmarks (Kinetics [5], Something-something [14], UCF101 [31]). showthat SMART consistently improves compared toother strategies while reducing factorof 4 10 times. Additionally, show when primary goal recognitionperformance, our strategy improve recent state-of-the-art modelsand various benchmarks (UCF101, HMDB51 [21],FCVID [17], ActivityNet [4]).
منابع مشابه
Information Theoretic Key Frame Selection for Action Recognition
This paper presents an approach for human action recognition by finding the discriminative key frames from a video sequence and representing them with the distribution of local motion features and their spatiotemporal arrangements. In this approach, the key frames of the video sequence are selected by their discriminative power and represented by the local motion features detected in them and i...
متن کاملBoosted key-frame selection and correlated pyramidal motion-feature representation for human action recognition
In this paper we propose a novel method for human action recognition based on boosted key-frame selection and correlated pyramidal motion feature representations. Instead of using an unsupervised method to detect interest points, a Pyramidal Motion Feature (PMF), which combines optical flow with a biologically inspired feature, is extracted from each frame of a video sequence. The AdaBoost lear...
متن کاملMulti-frame Super Resolution for Improving Vehicle Licence Plate Recognition
License plate recognition (LPR) by digital image processing, which is widely used in traffic monitor and control, is one of the most important goals in Intelligent Transportation System (ITS). In real ITS, the resolution of input images are not very high since technology challenges and cost of high resolution cameras. However, when the license plate image is taken at low resolution, the license...
متن کاملLearning discriminative features for fast frame-based action recognition
In this paper we present an instant action recognition method, which is able to recognize an action in real-time from only two continuous video frames. For the sake of instantaneity, we employ two types of computationally efficient but perceptually important features – optical flow and edges – to capture motion and shape characteristics of actions. It is known that the two types of features can...
متن کاملHuman Action Recognition in Smart Classroom
This paper presents a new system for teachers’ natural complex action recognition in smart classroom in order to realize intelligent cameraman and virtual mouse. First, the system proposes a hybrid human model and employs 2-order B-spline function to detect the two shoulder joints in the silhouette image to obtain the basic motion features including the elbow angles, motion parameters of the fa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i2.16235